Document Search Practices
نویسندگان
چکیده
IntroductIon A large portion of the knowledge of most organizations is contained in electronic documents. For users to get pertinent information from the accumulation of stored documents, they need effective document retrieval systems. Unfortunately, electronic document management has fallen into the same trap that electronic data processing fell into: simply automating what previously was done manually. Paper documents were stored in folders in drawers in file cabinets. Electronic documents are stored in folders in directories on disk drives. The ability to find a document depends on the logic of the filing system, how familiar the individual is with the filing system, and how familiar the individual is with the problem domain of the item being sought. Some persons (e.g., research librarians) are much better than others at organizing and retrieving documents. Rarely, however, is a manager an expert at either storing or retrieving documents. Unfortunately, many electronic filing systems are set up by managers with little or no training on how to organize a filing system, and few tools, other than the Windows Search command, are available to help managers find documents that have been filed. The filing systems for libraries and knowledge management systems are more sophisticated than the filing systems of most small offices or individual managers. But even libraries and knowledge management systems predominately rely on keyword searching for retrieval. For example, if one visits the Web site for the Journal of Management Information Systems at http://jmis.bentley. edu/keywords/, one notes that the only option available for searching (other than browsing the entire collection) is a keyword search. Keyword searching has improved over the years. Knowledge seekers have benefited enormously from the ability to search remotely, the increased speed with which searches are conducted, and the ability of the search mechanism to identify variations of the keywords. Nevertheless, keyword searches have significant limitations. In particular , keyword searches cannot return all relevant documents nor can they filter out irrelevant documents. This article briefly reviews the difficulties associated with keyword searches, especially as the number of documents increases, and proposes a way to overcome those limitations.
منابع مشابه
An Ensemble Click Model for Web Document Ranking
Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملCHARGED SYSTEM SEARCH FOR OPTIMUM DESIGN OF COST-EFFECTIVE STRUCTURAL BEST MANAGEMENT PRACTICES FOR IMPROVING WATER QUALITY
Best Management Practices (BMPs) are implemented in a watershed to reduce the amount of non-point source pollutants transported to water bodies. However, an optimization algorithm is required to choose the efficient type, size, and location of BMPs for application in a watershed for improving the water quality. In this study, the Charged System Search, a well-known and powerful meta-heuristic o...
متن کاملRRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کاملEvaluation of Corporate Governance Practices in Emerging Markets (A case study of Nigerian Banking Industry)
This study explores corporate governance practices within the context of the Nigerian banking industry using instances of corporate governance lapses that resulted in part to the Nigerian banking crises. We present multiple case analysis of publicly available documents and court papers (in the United Kingdom and Nigeria) to document instances of breach and areas of weakness in the existing Nige...
متن کاملروش جدید متنکاوی برای استخراج اطلاعات زمینه کاربر بهمنظور بهبود رتبهبندی نتایج موتور جستجو
Today, the importance of text processing and its usages is well known among researchers and students. The amount of textual, documental materials increase day by day. So we need useful ways to save them and retrieve information from these materials. For example, search engines such as Google, Yahoo, Bing and etc. need to read so many web documents and retrieve the most similar ones to the user ...
متن کامل